Managing Uncertainty in Schema Matching with Top-K Schema Mappings

نویسنده

  • Avigdor Gal
چکیده

In this paper, we propose to extend current practice in schema matching with the simultaneous use of top-K schema mappings rather than a single best mapping. This is a natural extension of existing methods (which can be considered to fall into the top-1 category), taking into account the imprecision inherent in the schema matching process. The essence of this method is the simultaneous generation and examination of K best schema mappings to identify useful mappings. The paper discusses efficient methods for generating top-K methods and propose a generic methodology for the simultaneous utilization of top-K mappings. We also propose a concrete heuristic that aims at improving precision at the cost of recall. We have tested the heuristic on real as well as synthetic data and anlyze the emricial results. The novelty of this paper lies in the robust extension of existing methods for schema matching, one that can gracefully accommodate less-thanperfect scenarios in which the exact mapping cannot be identified in a single iteration. Our proposal represents a step forward in achieving fully automated schema matching, which is currently semi-automated at best.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Improved Semantic Schema Matching Approach

Schema matching is a critical step in many applications, such as data warehouse loading, Online Analytical Process (OLAP), Data mining, semantic web [2] and schema integration. This task is defined for finding the semantic correspondences between elements of two schemas. Recently, schema matching has found considerable interest in both research and practice. In this paper, we present a new impr...

متن کامل

Schema Matching and Schema Merging based on Uncertain Semantic Mappings

This dissertation lies in the research area of schema integration: the problem of combining the data of different data sources by creating a unified representation of these data. Two core issues in schema integration are schema matching, i.e. the identification of correspondences, or mappings, between input schema objects, and schema merging, i.e. the creation of a unified schema based on the i...

متن کامل

Schema Meta - Matching Extended

Schema matching, the process of matching between concepts describing the meaning of data in heterogeneous, distributed data sources (e.g. database schemata, XML DTDs, HTML form tags, etc.) is one of the basic operations required by the process of data integration. Recently, several algorithms for automatic schema matching have been proposed and evaluated in the database community. While in many...

متن کامل

An Iterative Approach to Managing Uncertain Mappings in Dataspace Support Platforms

A DataSpace Support Platform (DSSP) is a self-sustained and self-managed system which needs to support uncertainty among its mediated schemas and its schema mappings. Some approaches for managing such uncertainty by assigning probabilities and reliability degrees to schema mappings have been proposed. Unfortunately, the number of mappings self-generated by a DSSP is usually too large and among ...

متن کامل

Self Organizing Semantic Topologies in Peer Database Systems

A semantic topology is a peer overlay network connected via semantic links, constructed using schema mappings and used for peer querying. The large-scale and dynamic environments of P2P networks dictate the use of automatic schema matching, which was shown to carry with it a degree of uncertainty, entailing uncertainty amplification over transitive mappings. Therefore, peers prefer network topo...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006